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ABSTRACT 

We present results from a near-infrared narrow-band survey of emission-line galaxies at z — 
2.23, using the Wide Field Camera on the United Kingdom Infrared Telescope. The H2SI 
narrow -band filter (Ac — 2.121/im) we employ selects the Ha emission line redshifted to z = 
2.23, and is thus suitable for selecting 'typical' star forming galaxies and active galactic nuclei 
at this epoch. The pilot study was undertaken in the well studied Cosmological Evolution 
Survey field (COSMOS) and is already the largest near-infrared narrow-band survey at this 
depth, with a line flux limit of F^a ^ 10^^^ ergs^^ cm^-^ over 0.60 square degrees, probing 
~220 X 10^ Mpc'"^ (CO -moving) down to a limiting star formation rate of ^^30 Mq yr ^ (3a). 
In this paper we present the results from our pilot survey and evaluate the Ha luminosity 
function and estimate the clustering properties of Ha emitters at 2: — 2.23 from 55 detected 
galaxies. The integrated luminosity function is used to estimate the volume averaged star 
formation rate at z = 2.23: psFR = 0.17to og Mq yr^^ Mpc^^ for Lrq > lO^'^ergs^i. 
For the first time, we use the Ha star-formation tracer to reliably constrain psFR out to z = 
2.23 demonstrating the rapid increase in psFR out to this redshift as well as confirming the 
flattening in psFR between z ~ 1-2. In addition to the luminosity distribution, we analyse the 
clustering properties of these galaxies. Using the 2-point angular correlation function, uj{9), 
we estimate a real space correlation length of tq = 4.2^0 2 Mpc. In comparison to models 
of clustering which take into account bias evolution, we estimate that these galaxies are hosted 
by dark matter halos of mass A/haio ~ 10^^ Mq consistent with the progenitors of the Milky 
Way. 
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1 INTRODUCTION 

One of the most fundamental, and challenging, goals of mod- 
em observational cosmology is to reliably determine the volume- 
averaged star-formation history of the Universe: its distribution 
function and variation with environment are powerful tools for un- 
derstanding the physics of galaxy formation and evolution. Surveys 
using a range of star-formation indicators suggest that the star- 
formation rate density (psfr) rises as ~(1 + z)^ out to z ~ 1 
(e.g. Lilly et al. 1995) and hence the 'epoch' of galaxy formation 
must occur at 2 > 1. Unfortunately, tying down the precise epoch 
of maximum activity at z > 1 is more difficult, with different star- 
formation indicators giving a wide spread in psfr (e.g. Small et al. 
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2002; Hopkins 2004) - although there is a general trend for pspR to 
plateau at z ~ 2 and hence it is likely that many of the stars seen in 
galaxies at the present-day were formed between z ^ 1-3. Indeed, 
2 2-2.5 appears to be a critical era in the evolution of many 
populations: optical QSO activity (which should be linked to the 
growth of super-massive black holes) peaks at this epoch (Boyle 
et al. 2000), as does the luminosity density in bright submillime- 
tre galaxies - thought to represent an early phase in the formation 
of massive galaxies (Chapman et al. 2005). This behaviour might 
be associated with the formation of spheroids or the most massive 
galaxies. It is not clear that this evolution is true of "typical" star 
forming galaxies (i.e. those with SFRs < 100 Af© yr~^). A sensi- 
tive survey of star forming galaxies in a range of representative 
environments at high-2 is needed to address this issue. 

Variation between the different psfr measurements at high- 
2 is in part due to a combination of the effects of sample selec- 
tion, Cosmic variance and biases in individual indicators. Estimates 
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from ultra-violet-selected samples (e.g. Madau et al. 1996) require 
large corrections for dust extinction and may miss the most ob- 
scured sources altogether. Similarly, although far-infrared-selected 
samples are less influenced by dust extinction, they require large 
extrapolations to account for sources below their bright luminosity 
limits (e.g. Chapman et al. 2005). Thus while no single indicator 
can give an unbiased view of the evolution of the psfr, mixing 
different indicators at different epochs is not helping our under- 
standing. 

What we require is one star-formation indicator that can be 
applied from z = 0-3, is relatively irrmiune to dust extinction and 
which has sufficient sensitivity that psfr estimates do not require 
large extrapolations for faint sources. The Hq emission line lumi- 
nosity is a well-calibrated SFR indicator providing a good measure 
of the instantaneous SFR of young, massive stars (>S Mq; Ken- 
nicutt 1998). Thus Ha combines all of these properties and hence 
is the tool of choice for studying the evolution of the SFR den- 
sity out lo z ^ 3 - covering the expected peak in the SFRD. The 
Ha luminosity function (LF), and the corresponding pspR has been 
measured out to to a ~ 1.3 by a series of investigators (e.g. Gal- 
lego et al. 1995; Tresse et al. 2002; Yan et al. 1999; Shioya et al. 
2008; Villar et al. 2008). These observations confirm the rapid rise 
seen by other indicators out to a ~ 1 (see also the compilation of 
Moorwood 2004). 

At higher redshift attempts to search for Ha emission have 
been made with near-infrared narrow-band surveys (e.g. Thomp- 
son et al. 1996; Moorwood et al. 2000), but these have been ham- 
pered by small survey areas. At the typical limit of these studies, 
Fhc ~ 10~^® ergs~^ cm~^, only a handful of potential « ~ 2 
Ha emitters have been detected. Indeed, in the largest survey of 
this type to date, Thompson et al. (1996) detected only one Ha 
emitter over 276 square arcminutes to a flux limit of Fna = 3.5 x 
10"^®ergs"^cm"^ This was confirmed to have an emission Une 
corresponding to z = 2.43 in the follow-up spectroscopic survey 
of Beckwith et al. (1998). In comparison, Moorwood et al. (2000) 
probe to a slightly deeper limit, Fua = 0-5 x 10~^® erg s~^ cm~'^ 
over 100 square arcminutes, and detect 10 candidate emitters, but 
all of the candidates chosen for spectroscopic follow-up turned out 
to be higher redshift [Oiii]A5007 emitters (Kurk et al. 2004; Moor- 
wood et al. 2003). Detecting large numbers of Ha emitters us- 
ing this method is clearly challenging, and larger (and preferably 
deeper) surveys are needed to efficiently probe the Ha luminosity 
density at high redshift. 

Larger scale narrow-band imaging surveys would also present 
the opportunity to measure the clustering properties of high-a star 
forming galaxy populations. Not only does this provide an impor- 
tant insight into the real space distribution of galaxies, but (assvun- 
ing a model for bias - i.e. how the observed galaxy populations 
trace the underlying matter distribution) can provide an estimate 
of the mass of dark matter halos that host such galaxies. Compar- 
ing the measurements of clustering for populations at high and low 
redshift, it is therefore possible to infer the likely progenitor popu- 
lations of galaxies seen in the local Universe. 

In this paper we make an improvement over previous z ^ 2 
Ha surveys by utilising the panoramic imaging capability of the 
Wide Field Camera (WFCAM, Casali et al. 2007) on the 3.8-m 
United Kingdom Infrared Telescope (UKIRT). We have obtained 
deep narrow-band imaging at A = 2.121/im to search for 2 = 2.23 
Ha emitters over 0.603 sq. degrees of the Cosmological Evolution 
Survey field (COSMOS) (Scoville et al. 2007). This corresponds 
to a co-moving volume ~30x that probed by Moorwood et al. 
(2000), which is the most similar previous survey in redshift cover- 



age and depth to this one. Here we present the results of our pilot 
survey, and use the Ha LF to evaluate the Ha luminosity density at 
z = 2.23, and hence constrain the evolution of psfr. In §2 we out- 
line the details of the survey and reduction techniques, §3 describes 
the narrow-band selection procedure, and in §4 wc present the re- 
sults, discuss some of the individual properties and calculate the lu- 
minosity function for the Ha sample. We compare our results with 
previous estimates for this epoch, and with the latest semi-analytic 
predictions for the Ha luminosity density at this epoch. The in- 
tegrated luminosity function provides us with the psFR, and we 
compare this to other measurements to place our result in a cosmo- 
logical context. We also present the first clustering analysis of Ha 
emitters at high redshift. The two-point angular correlation function 
yields the real-space correlation length, which we compare to other 
populations over cosmic time to gain insight into the likely descen- 
dants of these galaxies. We summarise the work in §5 and give 
a brief overview of our future extension to the survey. Through- 
out we have assumed h — 0.7 in units of lOOkms"^ Mpc~^ and 
(n„i,nA) = (0.3,0.7). Unless otherwise stated, all magnitudes 
are on the Vega scale. 

2 OBSERVATIONS AND DATA REDUCTION 
2.1 Observations 

The COSMOS field was observed during 2006 May, Novem- 
ber and December with WFCAM on UKIRT, with the broad- 
band (Jf-band) and narrow-band (H2SI, Ac = 2.121pm, 6X = 
0.021pm) filters. WFCAM's standard 'paw-print' configuration 
of four 2048x2048 0.4"/pixel detectors offset by ~20' can 
be macro-stepped four times to mosaic a contiguous region of 
~55' X 55' (Casali et al. 2007). In this way we mapped 3 
paw-prints, or '^0.62 square degrees of COSMOS (centred on 
10''00™28.6", -h02'*12'21.0" J2000). Our narrow-band exposure 
time is ~13.5ks/pixel (in order to reach the equivalent continuum 
limit, the broad-band K exposures were ~0.5 ks/pixel). The seeing 
was consistently of the order <l". To help with Cosmic ray rejec- 
tion over the relatively long narrow-band exposures (40 or 60 s), we 
used the NDR (Non Destructive Read) mode, whereas CDS (Cor- 
related Double Sampling) mode was used for the broad-band expo- 
sures. The near-infrared observations were obtained in the standard 
way, following a 14-point jitter pattern. In addition to the jitter pat- 
tern, to improve sampling of the PSF with the 0.4" pixels of WF- 
CAM, the narrow-band frames were microstepped in a 2 x 2 grid 
with 1.2" offsets at each position. We summarise the observations 
in Table 1. 



2.2 WFCAM data reduction 

For a given frame in a jitter sequence of 14 images, we self-flat 
the data using a normalised median combination of the remaining 
frames in the sequence, taking care to mask-out bright objects in 
each frame. This is achieved by creating a two-pass flat: in the first 
pass a rough flat field image is made from a straight median com- 
bination of the data frames in order to reveal the bright objects. 
In the next step, we use SExtractor's check_image output 
(v2.3.2 Bertin & Amouts 1996) to identify regions where bright 
stars and galaxies lie and then mask these when constructing the 
master flat. This minimises the artifacts seen around bright sources. 
Some care must be taken with the micro-stepped frames - the small 
offset between consecutive frames in a micro-sequence are smaller 
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Table 1. Observation log of the COSMOS field. Coordinates are in J2000 and refer to the centre pointing of a WFCAM 'paw-print' . The average seeing in the 
broad and narrow-band frames (stacked frames) is ~ 1 " . 
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Figure 1. The K — H2SI colour-magnitude diagram used to select narrow- 
band excess objects, and thus possible 2 = 2.23 Ha emitters. Narrow-band 
colours have been corrected to account for broad-band continuum slope 
using the (z—K) broad-band colour (see §3.1). S = 2, 2.5, 3 & 5 selection 
cuts are shown, as is a 50A minimum in equivalent width used to select 
significant emitters. The E cuts here are averaged over the whole field area, 
but these change slightly for each WFCAM tile as we take into account 
individual frame zeropoints and noise properties (although there is only a 
small variation over the survey). We show all detections that have would 
been selected as narrow-band excess sources as filled circles (limits indicate 
that source was not detected in the broad-band frame). These narrow-band 
excess sources are subject to a further selection described in more detail in 
§3.2. 

than many extended sources, therefore we construct flats for each 
position in the micro-stepped sequence separately as if they are part 
of a standard dither. 

A world coordinate system is fit to each frame by querying 
the USNOA2.0 catalogue, fitting on average ~100 objects, and 
then frames are aligned and co-added with SWARlf^ (this performs 
a background mesh-based sky subtraction, optimised to provide the 
best representation of the large scale variations across the field). In 
the case of the micro-stepped (narrow-band images), this step is 
preceded by drizzling the frame onto a grid with a pixel scale of 
0.2" (i.e. half the camera pixel scale). 

WFCAM frames suffer from significant cross-talk, manifest- 
ing itself in toroidal features at regular (integer multiples of 128) 

^ Both SWARP and SExtrACTOR are distributed by Terapix 
jhttp : / / terapix . lap . f r) at the Institut d'Astrophysique de 
Paris. 
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Figure 2. Three examples of narrow-band excess sources of various signif- 
icance, S. The top, middle and lower panels show E ~ 5, S ~ 3, S ~ 2.5 
respectively. The left hand panels show the /C-band continuum, the centre 
shows the H2SI narrow-band image, and the light shows the continuum- 
subtracted narrow-band image. The A'-band image has been scaled to ac- 
count for the filter widths and both frames were locally registered with im- 
proved astrometric accuracy and smoothed with a Gaussian kernel to the 
same effective seeing. Images are 30" X 30" square and orientated North 
up and East left and scaled identically. Note that all images have been re- 
binned to the scale of the K-hand frame: 0.4"/pix. 

pixel intervals from sources in the read-out direction. As these are 
tied to the positions of real objects, they are not removed by dither- 
ing the frames. High order cross-talk can easily mimic real objects, 
therefore these positions are flagged at the catalogue stage; we dis- 
cuss this further in §3. 

Broad and narrow-band frames are calibrated by matching 
10 < -ft: < 15 stars from the 2MASS All-Sky Catalogue of Point 
Sources (Cutri et al. 2003) which are unsaturated in our frames. 
Since the narrow-band filter falls in the Jf-band, we can check the 
zero-point offsets to be applied to the narrow-band frames directly 
from the broad-band calibration, taking into account the relative 
widths of the filters, such that the offset between the zeropoints is 
2.5 log(AAif/AAH2Si)- As an additional verification, we check 
that /i' — H2SI ~ as expected using the photometry of the bright 
stars. We estimate our absolute K calibration is good to < 1% from 
the scatter compared to 2MAS^ 



^ An independent data reduction of a sub-set of the science frames by the 
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2.3 Source extraction and survey limits 

Our survey is made up of a mosaic of tliree WFCAM paw-prints, 
i.e. twelve 13.7' x 13.7' tiles. Sources are detected and extracted 
using SEXTRACTOR (Berlin & Amouts 1996). A final narrow-band 
catalogue is made by matching detections in the H2S 1 to the broad- 
band image. Since the frames are well aligned astrometrically, a 
simple geometric matching algorithm suffices, with a maximum 
search radius of 2". Cross-talk artifacts in WFCAM frames can 
mimic the narrow-band excess objects we are selecting for. There- 
fore, to clean the catalogue from possible cross-talk contaminants 
we measure the chip [x, y) position of bright sources, and flag re- 
gions at integer multiples of 128 pixels along the direction of read- 
out. These chip positions can be converted to a sky coordinate and 
then compared to the narrow-band catalogue. As a rule, we remove 
an object from the catalogue if it lies within 2" of any cross-talk 
position. Note that by avoiding the zcroth order cross-talk position, 
we also avoid spurious detections close to bright stars (although 
in this case the exclusion radius is larger to account for extended 
halos and diffraction spikes, at 30"). Due to the lower exposures 
received by the edges of the individual frames (caused by the jit- 
ter pattern), we ignore detections from within ~10" of any frame 
edge. The total area lost due to cross-talk and star masking is neg- 
ligible (< 0.1%), however including the edge-clipping we survey 
an effective area of 0.603 square degrees. 

Due to the fact that the observations were acquired over a long 
period of time, with slightly different seeing and exposure times 
(Table 1), we treat each tile separately, detecting objects down to 
each tile's limit. The average 3fT depth of the broad-band frames is 
K = 20. 1 mag; we detect a total of 19079 objects down to this limit 
across 0.603 sq. degrees. Note that all magnitudes are measured in 
3" diameter apertures, and this recovers the majority of the flux. 

To confirm the depth and completeness of the images, we per- 
form Monte Carlo simulations of the detection of a large number 
of 'fake' sources (a single Gaussian point source generated with 
IRAF's ARTDATA with a FWHM matching the image PSF). By scal- 
ing the artificial source over a range of known flux, and inserting 
at random locations over the frames, we measure the detection rate 
(i.e. the completeness) as a function of observed magnitude using 
an identical extraction procedure to that used to generate the main 
catalogue. We use this detection completeness information to cor- 
rect the derived liuninosity function of Ha emitters. 



3 SELECTION TECHNIQUE 
3.1 Narrow-Band excess selection 

Potential z = 2.23 Ha emitting candidates are initially selected 
based on the significance of their narrow-band excess, i.e. their 
JsT — H2SI colour Galaxies with a strong emission line falling in 
the narrow-band filter will have H2SI < K. The significance of 
the narrow-band excess can be parameterised in terms of S, which 
quantifies the excess compared to what would be expected for a 
source with a flat spectrum (Bunker et al. 1995). This parameter is 
similar to a standard signal-to-noise selection, and we chose to se- 
lect candidate emitters with S > 2.5. This is equivalent to a colour 
of: 

rriK - mH.si = -2.5 log [l - E510-°-*('=-'"«2si)] (i) 

Cambridge Astronomical Survey Unit (CASU) confirms consistent pho- 
tometry (M. Riello private communication). 



where c is the zeropoint of the narrow-band image and 5 is the pho- 
tometric uncertainty. The Ha Une flux Fna and equivalent width 
Wna can be calculated: 

Fna = AAHaSi^j — {a \ ^ 

1 — (AAHsSi/AA/f j 

and 

1^H„ = AAh.si /h.si-Zk 

JK — /h2Si(AAh2Si/AAk) 

where AAk, AAhsSi are the widths of the broad and narrow-band 
filters and /k and /H2S1 are the flux densities in the two filters. 
VFho is simply the ratio of the line flux and continuum flux den- 
sity, and will also include contribution from the adjacent [Nil] 
line. However, we calculate the expected Hq line flux by assum- 
ing F[Nn]/fHc« ~ 0.33 for extragalactic Hll regions (Kennicutt 
& Kent 1983) and correcting appropriately. Note that this may re- 
sult in conservative Ha fluxes, since for metal-poor galaxies this 
contribution may be lower 

In Figure 1 we present the colour-magnitude diagram with se- 
lection criteria. Some care must be taken in deciding on the selec- 
tion, and we briefly describe it here. Firstly, we only use narrow- 
band detections with detection significances of > 3a. It is not nec- 
essary that the source is also detected in the broad-band frame, and 
we assign an upper limit if there is no corresponding continuum 
detection. We define an initial selection criteria of E > 2.5 (this is 
a function of narrow-band flux and combined error due to sky noise 
in both the broad and narrow-band images, as described above). We 
account for bright foreground objects with steep continua across 
the K-haxiA by making use of the z — K colours available for all 
detections from the COSMOS archival z-band imaging (Capak et 
al. 2007). Their very red continua will result in a larger effective 
wavelength for the broad-band detections. We correct for this to 
make a better estimate of the continuum contribution to the narrow- 
band flux at the 2. 121/im. We measure — H2SI as a function of 
{z — K), and fit the data with a linear trend using a least squares 
fit. The slope of this line is then be used to correct the — H2SI 
colour We find that on average the correction factor to the narrow- 
band magnitude excess is 0.03(2 — K) . To account for narrow-band 
detections that have large significance but low equivalent widths 
(e.g. van der Werf et al. 2000), we enforce an additional selection 
criteria of Wna > 50A. This is an arbitrary threshold, but is cho- 
sen to reflect the general scatter about /sT — H2SI = (see Figure 
1). 

The Ha line-flux limit is the theoretical minimum that could 
be detected taking into accoimt all these selection criteria. The av- 
erage 3(7 Ha flux limit over the survey is 1 x 10~^® erg s~^ cm~^. 
As a final quality control check, we visually inspect each candidate 
in order to filter any remaining erroneous sources (for example, 
non-crosstalk associated artifacts, etc.). In Figure 2 we present the 
narrow-band field around three candidate emitters representing a 
range of detection significance (2.5 < ^ < 5). For each panel we 
show the narrow-band frame before and after continuum subtrac- 
tion using the broad-band image after suitably scaling to account 
for the relative filter widths. We detect a total of 180 sources meet- 
ing the narrow-band selection criteria across 0.603 square degrees. 

3.2 Broad band colours: further selection, redshift 
confirmation and line contamination estimates 

We must be certain that the selection in colour space is actually 
identifying the Ha line at 2; = 2.23 rather than another strong emis- 
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Figure 3. Broad-band colours of narrow-band selected objects from our survey, (left) iigr colours with the LBG and BM/BX selection criteria of Steidel et 
al. (2003) (right) BzK colours divided into high redshift starforming galaxies (sBzK) and high redshift passive galaxies (pBzK) and foreground galaxies, using 
the selection of Daddi et al. (2004). In both cases we compare to a 1-in-lOO sampling of the general colour distribution using photometry available for the 
COSMOS field. It is clear that the straightforward narrow-band selection outlined in §3. 1 is also detecting low redshift galaxies, and we require further selection 
with broad-band colours to identify these. However, the low redshift detections likely correspond to real emission lines, and we discuss this further in §3.2. 
Large points indicate our secure Ha emitters after employing our secondary colour selection outlined in §3.2 and we highlight the high significance (S > 5) 
candidates, noting that all of these fall in the sBzK and BX/BM selection regions. For clarity, we only show photometric uncertainties for Ha candidates. 



sion line at another redshift. At higher redshift, the most important 
contaminant is [OlIl]A5007 ai z = 3.23 (although these objects 
are interesting in their own right as possible examples of high-z 
AGN or star-forming galaxies). At low redshift, we are potentially 
susceptible to any strong line with an emission wavelength between 
Ha andH2Sl. 

The most robust way to confirm the selection is through 
follow-up spectroscopy which can unambiguously identify the 
lines. However, complete spectroscopic follow up (at least in the 
short-term) is unfeasible. A 'cheaper' alternative is to make use 
of extensive multi-band optical/near-infrared photometry available 
for COSMOS (Capak et al. 2007), which can be used to improve 
the Ha selection. Unfortunately, at z = 2.23 the current pho- 
tometric redshifts are not reliable enough to confirm Ha emis- 
sion (Mobasher et al. 2007), so we turn to the cruder method of 
colour/colour-selection to clean our high redshift sample. 

We therefore define a secondary optical selection, tuned ac- 
cording to the colours of a sub-sample of high significance (E > 5) 
Ha candidates from the pure narrow-band selection. To discrim- 
inate between high and low redshift emission line galaxies, we 
first use the sBzK selection first presented by Daddi et al. (2004), 
and now becoming popular to select active galaxies at 2 > 1.4. 
An alternative selection of high redshift galaxies is made with ugr 
colours, and has been used to search for z ~ 3 Lyman Break galax- 
ies (LBGs, Steidel et al. 1996) and 'BM/BX' star-forming galaxies 
at 2 ~ 2 (Adelberger et al. 2004, Steidel et al. 2003). We use the 
BM/BX selection as a further indicator that the narrow-band se- 
lected galaxies are at high redshift (additionally the LBG selection 
can also be used to detect possible 2 '-^ 3 [O III]A5007 emitters). 
Candidate Ha emitters are narrow-band excess objects which also 
satisfy the BM/BX selection or the sBzK selection. In a further ef- 
fort to eliminate foreground contaminants we impose a magnitude 



cut of 2 < 22.5 (equivalent to M„ < -23.7 at 2 = 2.23). This 
yields a total Ha sample of 55 compared to our original selection 
of 180 sources on the basis of narrow-band selection alone. We 
present the colour-colour plots with selection criteria in Figure 3. 
After the colour selection, the majority of Ha emitters fall in both 
the BM/BX and sBzK regions, with those that do not having large 
enough photometric errors that place them < la away in colour- 
space. Note that two objects flagged as Ha emitters are potentially 
low-redshift [Fell] A 1.64 emitters based on their photometric red- 
shifts (see Figure 4 and discussion below). Thus, the photometric 
selection is not a fool-proof method, although it suggests the con- 
tamination rate is low (< 3%). To estimate the number of galax- 
ies that could have been potentially missed by this further colour 
selection, we Monte Carlo our selection by allowing the photom- 
etry to vary randomly in a range defined by the (la) uncertainty 
on each point, then count the number of galaxies satisfying the Ha 
criteria and compare this to the nominal value. Repeating this pro- 
cedure many times builds up a measure of the the likely contam- 
ination/incompleteness as galaxies are scattered in and out of the 
selection regions. Our results suggest that the number of galaxies 
missed (or alternatively the interloper fraction) could be as much 
as 10%. Assessing this contamination more accurately will require 
follow-up spectroscopy, which at the time of writing, is underway. 

What are the remaining narrow-band excess objects which fail 
our colour selection? At least one appears to be a high redshift (pos- 
sibly [OIII]A5007) emitter selected in the LBG cut, but the bulk 
of the remaining objects are likely to be low-redshift given their 
colours. There are several other low redshift emission lines that 
could potentially be detected, and to investigate this, in Figure 4 
we plot the probability distribution P{z) of 2phot of (a) candidate 
Ha emitters, (b) Paa and Pa/3 candidates and (c) unclassified re- 
maining narrow-band excess detections. Using a conservative pho- 



6 J. E. Geach et al. 



tometric redshift uncertainty of dz — ±0.05 we identify 12 Paa 
and 6 Pa/3 candidates. In the unclassified redshift distribution there 
are two clear peaks in the distribution at z ~ 0.4 and z ~ 0.9. 
The low redshift emitters could potentially be [Fell] A1.6/im or sev- 
eral Brackett lines, whereas the slightly higher redshift peak might 
correspond to Pa7. Clearly this must be confirmed through spec- 
troscopy, but it appears that the narrow-band selection is detect- 
ing real line-emitters, but is not sufficient alone to isolate Ha at 
z — 2.23. Identification of lower redshift interlopers will become 
easier when more accurate photometric redshifts are available for 
COSMOS (Mobasher et al. 2007 & private communication). 

To check for contamination for higher-z line emitters, we use 
a ugr cut to select z ~ 3 Lyman Break galaxies (LBGs), and so 
identify possible [OlIl]A5007 contaminants (Figure 3). In our to- 
tal narrow-band selected sample, only one object satisfies the LBG 
selection criteria, and therefore we conclude that the z = 3.23 
[OlIl]A5007 contamination rate is negligible for our sample. We 
note that in a follow up spectroscopic survey of narrow-band iden- 
tified z = 2.2 Ha emitters by Moorwood et al. (2000), six of 10 
candidates were confirmed as 2 = 3.23 [OlIl]A5007. Thus it is not 
clear whether environmental effects are responsible for the high 
contamination observed in this previous small scale survey. 

Finally we assess the impact of contamination from AGN at 
z = 2.23 which would impact on our assessment of psFR from the 
Ha LF. We are probing the expected peak in the SFRD, but this is 
also associated with the peak of AGN activity (Shaver et al. 1996; 
Boyle & Terlevich 1998) - what is the expected contamination rate 
in our Ha selected sample? As with contamination from emission 
lines at different redshifts, without spectroscopic information (for 
example the Ha/[Nll] ratio discriminating between star-formation 
and nuclear activity), we must address this issue statistically. As 
described by van der Werf et al. (2000), the AGN interloper frac- 
tion is likely to be low: the fraction of AGN in local Ha selected 
galaxies is <5% (Gallego et al. 1995), and this is not likely to in- 
crease toward high redshift (Teplitz et al. 1998). On the other hand, 
Shioya et al. (2008) include a 15% correction for AGN contami- 
nation in the Ha luminosity function at 2 = 0.24, based on the 
results of Hao et al. (2005) who studied the Ha luminosity density 
from AGN in the Sloan Digital Sky Survey. We adopt this same 
15% AGN contamination in our measurement, resulting in a more 
conservative estimate of psFR- 

3.3 Spectroscopic comparison 

The zCOSMOS survey (Lilly et al. 2007) is an extensive spec- 
troscopic project obtaining thousands of redshifts in the COS- 
MOS field. To search for any existing spectra corresponding to 
our narrow-band excess sample, we cross-correlate our catalogue 
with the current zCOSMOS spectroscopic catalogue (S. Lilly priv. 
comm.). We do not expect a large degree of overlap between our 
catalogue and zCOSMOS, since we are detecting relatively faint 
galaxies, and indeed there were only five matches with secure red- 
shifts. One Ha candidate is included in the 'faint' zCOSMOS cat- 
alogue and confirmed to securely lie at z — 2.2329, verifying the 
selection at least for this candidate. The remaining matches are all 
in the 'bright' sample. Three of these were identified by us as be- 
ing possible Paa at 2 ~ 0.13, and indeed two have confirmed 
redshifts of 2 = 0.1312 and 2 = 0.1298, although another has 
z — 0.1086. The final match has 2 = 0.7340, and it is unclear 
if this corresponds to a real emission line. Although this is a very 
limited spectroscopic confirmation, it does suggest that we are de- 
tecting a range of emission lines, and that our secondary filtering 




Photometric redshift 

Figure 4. The summed probability distributions P{z) for photometric red- 
shifts: (top) nan'ow-band excess detections satisfying our ugrlBzK selec- 
tion. Note that there is a spike in P{z) at z ~ 0.3, corresponding to the 
redshift of [FeII]A1.64 in the H2SI filter. This spike represents two objects 
and these are removed from our sample. The broad distribution of Zpi^ot at 
z > 1 are probably real Hq emitters with poorly detennined photometric 
redshifts; (middle) distribution of galaxies selected as Paa and Pa/3 emis- 
sion lines with ZpjjQj^ = 0.13±0.05andZphot = 0.65±0.05 respectively; 
(bottom) remaining 'unclassified' emitters. There are two main peaks to the 
distribution of unclassified objects: at z ~ 0.3 which could correspond to 
[Fell] Al. 64 or several Brackett lines. At z ~ 0.8 the peak could correspond 
to Pa7. 

technique is useful for weeding out low redshift interlopers. We 
note that our near-infrared spectroscopic follow-up will allow us to 
make a much better estimation of the contamination rate. 



4 RESULTS 

We detect a total of 55 robust candidate z — 2.23 Ha emitters over 
0.603 square degrees of the COSMOS field, resulting in a surface 
density of 91 ± lOdeg"^ above a flux limit of 1 x 10^^'' ergs"^, 
equivalent to a SFR of 30 Mq yr~^ at this redshift. In the following 
sections we briefly discuss their multi-wavelength and morpholog- 
ical properties, and use the sample to evaluate the Ha luminosity 
function and thus star-formation rate density at 2 = 2.23. Finally, 
we make an estimate of the real space correlation length of Ha 
emitters by measuring their two-point angular correlation function. 

4.1 Ha luminosity density at 2 = 2.23 

4.1.1 Evaluating the luminosity function of Ha emitters 

To calculate the luminosity function of Ha emitters, we convert 
line fluxes to luminosities, correcting for 33% [Nil] contribution to 
the measured flux. The volume is reasonably well defined by the 
survey area and narrow-band filter width (50% cut-on/off: 2.111- 
2.132/im), which probes a co-moving depth of ~40Mpc, and is 
equivalent to 220xlO^Mpc'^ (co-moving). However the reader 
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should note that since the narrow-band transmission function is not 
a perfect top-hat, the effective volume probed will vary as a func- 
tion of intrinsic luminosity, with more luminous Ha emitters be- 
ing detected over a larger volume than fainter sources. Similarly, 
the incompleteness will vary strongly as a function of apparent lu- 
minosity, since faint Ha emitters at redshifts corresponding to the 
low-transmission wings of the narrow-band filter might be lost, and 
bright lines will have their luminosity underestimated, contribut- 
ing to the counts in lower luminosity bins. To estimate the impact 
of this effect on our measured results, we model these observa- 
tional effects on a 'true' model luminosity function. Our simulation 
consists of a set of model Ha emission lines convolved with the 
H2SI transmission function, where the lines are randomly placed 
in a redshift range corresponding to the full width of the filter. Af- 
ter convolving 1000 such lines per luminosity interval, we recover 
the effective volume probed, the incompleteness and luminosity 
biases (i.e. including intrinsically bright emitters dimmed by the 
edge of the transmission function falling in lower luminosity bins). 
The combination of these effects modifies the model luminosity 
function, the main change being a ~10% reduction in the faint- 
end slope a and a ~20% decrease in (j>* compared to a uniform 
transmission model. The effect on L* is negligible. As we describe 
below, since we fix the faint end slope a = —1.35, the difference 
in the integrated luminosity function above the simulated detection 
threshold is slightly less than if this was a free parameter, ^^10%, 
and we correct for this difference in our measurement of psFR- 

Although Hq is relatively insensitive to dust extinction (un- 
like Lya, it is not resonantly scattered), dust absorption must still 
be taken into consideration when deriving luminosities. A robust 
technique would be to obtain spectroscopy of each source to mea- 
sure the Balmer Ha/H/3 decrement, but this is not currently feasible 
for our sample. Instead we simply adopt Aua = 1 mag (Kennicutt 
1992; 1998), as is applied in similar earlier studies (Fujita et al. 
2003; Pascual et al. 2005; Ly et al. 2007) for SFR estimates. Al- 
though it is known that the specific extinction is a function of SFR 
(Jansen et al. 2001; Aragon-Salamanca et al. 2003; Hopkins et al. 
2001), for ease of comparison with other samples, we apply this 
constant correction to our sample, noting that in the cases where 
the actual extinction is higher, our measured SFRs will be conser- 
vative. We compare SFR indicators for our sources in §4.2.2 and 
confirm that the correction used here is appropriate. 

Our LF has been corrected for completeness on a bin by bin 
basis using the completeness functions estimated from the narrow- 
band simulation (§2.3), and the lower luminosity limits have been 
dealt with by performing a survival analysis on the binned data, 
using the probability estimator of Avni et al. (1980). Note that 
we apply a correction for the incompleteness caused by the shape 
of the H2SI transmission described above on the integrated lumi- 
nosity function (see §4.1.2). The errors are derived from the Pois- 
son statistics of each bin, although we apply a slightly more con- 
servative bootstrap analysis to estimate the uncertainties on the 
fit values, described in more detail below. We fit the LF with a 
Schechter function characterised by a, 0* and L*: <j){L)AL = 
(j)* {L/ L*)" exp(— L/L*)d(L/L*). Since we lack sufficient depth 
to fit the faint end slope, we fix a = —1.35, the value observed by 
Gallego et al. (1995) for their z — Ha study. This approach was 
also used by Yan et al. (1999) for the Ha LF sX z = 1.3. There- 
fore we cannot show any evolution in the faint end slope of the 
Ha LF - a deeper survey is required for this. In fact Reddy et al. 
(2007) find evidence for a steeper faint end slope for UV/optical 
colour-selected star-forming galaxies at high redshifts {z ^ 2~3). 
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Figure 5. Ha luminosity function from this work compared to those derived 
at z = (Gallego et al. 1995) and z = 1.3 (Yan et al. 1999) (these are not 
corrected for dust extinction). Errors are based on the Poisson counting er- 
ror and the completeness correction applied to each bin, derived from the 
detection rate of nan'ow-band selected sources. The luminosity function is 
fit with a Schechter function with a fixed faint end slope, a = —1.35 (iden- 
tical to Yan et al. 1999 and Gallego et al. 1995) - deeper observations will 
be required to probe the evolution of the faint end slope of the luminosity 
function. Our results show significant evolution from z = equivalent to 
an order of magnitude in luminosity evolution. Our LF is also very siinilar 
to the z = 1.3 Ho luminosity function, consistent with little or no evo- 
lution between z = 2.23 and z = 1.3, interpreted in terms of the star 
formation rate density as a peak or plateau occurring at z ~ 2 (see Figure 
6). We compare to the theoretical prediction of the luminosity distribution 
of Hq emitters from the Bower et al. (2006) variant of the semi-analytic 
GALFORM code (Cole et al. 2000). The observed counts agree well with the 
inodel, although there are hints that the faint end slope is slightly steeper 
than our assumed value. 

Adopting a a < —1.35 would obviously have a rather dramatic 
effect on extrapolated values for psFR, which we discuss in §4.1.2. 

To estimate uncertainties on the LF, we refit the Schechter 
function by bootstrapping, with the variance of the resulting pa- 
rameters forming our uncertainty in 0* and L* . We find (j)* — 
(1.45 ±0.47) X IQ-^Mpc-^* and log L* = 42.83 ± 0.13 erg s~\ 
We present the results in Figure 5 in comparison to lower redshift 
studies of the Ha LF z — 0, demonstrating the sharp increase in 
Ha emitters from z = 0-2.23. Our results are very similar to the 
luminosity distribution found by Yan et al. (1999), suggesting that 
there is little evolution of the bright end of the Ha LF across the 
2Gyrs between z — 2.23 and z = 1.3. 

4.1.2 The star-formation rate density at z — 2.23 

Next we use the integral of the LF to find the volume aver- 
aged star-formation rate density, psFR. In Figure 6 we present 
the evolution of psFR out to 2: = 2.23 using Ha only. In or- 
der to compare results, we take the LF parameters for Ha sur- 
veys compiled by Hopkins (2004) and integrate these down to 
Lua = lO**^ ergs~^, the approximate luminosity limit of the 
present survey. To convert Ha luminosity to a star-formation rate, 
after applying a factor 2.5 to Lhq to account for Ana ~ 1 mag. 
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Figure 6. The evolution of the star-formation rate density out to 2 = 2.23 using the Ha tracer alone, and compared to a heterogeneous mix of multi-wavelength 
tracers (all converted to the cosmological parameters used throughout this work). The data are collected taken from the compilation of literature data made by 
Hopkins (2004), and recent works of Shioya et al. (2008) and Viilar et al. (2008). Where corrected for extinction, are modified by a common factor Ajja = 
1 mag. In the top panel, we calculate the integrated Ha luminosity down to the approximate luminosity limit of the present survey, Ljjq > 10*^ erg s~^. Our 
results support a flattening of the psFR between In the bottom panel we apply the same integration range as Hopkins (2004), extrapolating down to 

Lhci > 10"^^ erg s~^. The general evolution of psFR is mirrored here, but the reader should be aware of the potentially strong impact uncertainties in the 
faint end slope have on the integrated luminosity function. Surveys of high-redshift star-forming galaxies have yet to satisfactorily constrain this parameter. 



we use the standard calibration of Kennicutt (1998); SFR[Ha] — 
7.9 X 10~*^(Z/Hci/ergs~^) Mq yr~^ (assuming continuous star 
formation. Case B recombination at Te = 10 K and a Salpeter 
initial mass function [IMF] ranging from O.l-IOOA/©). Correct- 
ing for the transmission incompleteness described in §4.1.1 and a 
15% correction for AGN contamination (Shioya et al. 2008), we 
find PSFR ~ 0.17^Q Qg A4q yi~^ Mpc~^ where the uncertainty is 
equivalent to the la uncertainties on the luminosity function fit pa- 
rameters. Our results support a flattening or peak in the level of 
star-formation activity between 2:~l-2. 

In Figure 6 we also show the evolution of psfr, but ex- 
trapolated to a much fainter luminosity, Lua > lO'^^ergs"^ 
(SFR > 8 X 10"'' M0 yr~^), in line with the psFR compila- 
tion of Hopkins (2004). We remind the reader of the uncertainty 
inherent in interpreting psFR estimates derived from such extrap- 
olations: the uncertainty in the slope of the faint end of the lumi- 
nosity function could have a dramatic effect on the result. For this 
extrapolated case (again, corrected for extinction), we find psFR ~ 
0.25j:o;f3 A/q yr~^ Mpc"^. This is in broad agreement with the 
range of psFR estimates (for a variety of tracers) at z — 2.23 
(Fig. 6) and confirms the plateau in psFR above z ~ 1. 

Our measurements of psFR are consistent with the results of 
Reddy et al. (2008), who estimate the Ha luminosity density at 
1.9 ^ z < 2.7 by using the correlation between dust-corrected 
ultra-violet and Ha SFRs (Erb et al. 2006). They find psfr ~ 
0.35 ± 0.09 Mq yr~^ over this (albeit larger) redshift range. Reddy 



et al. (2008) measure a steeper faint end slope for the ultraviolet 
luminosity density at 2: ~ 2, a = —1.88. If this behaviour is re- 
flected in the general star-forming population at this redshift, then 
we would also expect a steeper Ha faint end slope. This would in- 
crease the extrapolated psfr presented here. For example, fixing 
a — —1.65 (more in line with the theoretical luminosity distri- 
bution from Bower et al. 2006), psfr would increase by nearly a 
factor of 2. This should be taken as a cautionary point by the reader, 
when interpreting our extrapolated results. 

There is a final caveat regarding the luminosity function pre- 
sented in Figure 5; namely the treatment of dust. In this analysis 
we have assumed a uniform correction, Aua = 1 mag for ease of 
comparison with other results (we discuss the validity of this as- 
sumption in §4.2.2). Note that, in reality, this correction may be a 
function of luminosity, such that the most luminous systems require 
larger corrections. This would have most effect on the bright end of 
the LF, where we suffer the most from low-number statistics. 



4.1.3 Comparison to theoretical predictions of the luminosity 
distribution 

Recent recipes for galaxy formation using a semi-analytic pre- 
scription provide us with the opportunity to directly compare ob- 
served Ha line luminosities with theoretical predictions (Bower 
et al 2006). In Figure 5 we present the Ha luminosity function 
at 2; = 2.23 from the Bower et al. (2006) model (hereafter 'B06 
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Figure 7. 5" x 5" HST ACS thumbnail images of the 55 2: = 2.23 Ha candidates. The images are coloured with a map generated from the ground based 
Brz imaging, and use the F814W filter as a luminance channel (all images are scaled identically and orientated with north up and east to the left). Note that at 
2 = 2.23, 5" is equivalent to ~40 kpc in projection. The images are organised row-wise in order of decreasing Ha luminosity such that the top left thumbnail 
is the most luminous, and the bottom right is the least. There are two types of morphology: very compact or more extended (generally disturbed) systems. Many 
of the disturbed morphologies show evidence for two bright components within ~ 1" of each other - indicative of recent or ongoing interaction which might 
be powering star-formation. The colour images appear to support this, with some of the more extended sources showing clear red and blue parts, reminiscent 
of merging star-forming galaxies in the local Universe. 



model') which populates dark matter halos in the A'^-body Milli- 
Millennium simulation with model galaxies (1/16 the volume of 
the full Millennium simulation, Springel et al. 2005). The B06 
model is a variant of GALFORM, the Durham semi-analytic code 
(Cole et al. 2000; Benson et al. 2003), which includes a new pre- 
scription for AGN feedback. Such feedback appears to be very im- 
portant for preventing the cooling of baryons in massive galaxies, 
and thus steepening the bright end of the luminosity function (see 
Bower et al. 2006). We take the total Hq luminosity from the cat- 
alogue, and enforce the same equivalent width cut, Whq > 50Aas 
in our narrow-band selection. No other constraint is placed on other 
observable parameters. The observed luminosity distribution gen- 
erally appears to agree well with the predictions, although there 
appears to be a slight excess of bright Ha emitters in the model. 
The origin of this disparity could be caused by Cosmic variance 
effects or over-production of bright star-forming galaxies in the 
simulations, exacerbated by the low-number statistics in the high- 
luminosity bins. Nevertheless, the overall rough agreement of the 
Ha luminosity density over the observed range is encouraging. 

4.2 The properties of Ha emitters 

4.2. 1 Hubble Space Telescope ACS imaging - morphologies 

In Figure 7 we present Hubble Space Telescope (HST) ACS F814W 
thumbnail images of the candidate Ha emitters. The exquisite 
imaging from HST allows their rest-frame UV morphologies to be 
examined, and we have used this, along with BRz Subaru Suprime- 
Cam optical imaging from the COSMOS archive to preserve mor- 
phological information and reveal colour gradients that vary on 
scales comparable to ground-based imaging resolution. For exam- 
ple this allows us to distinguish red and blue components in an in- 



teracting galaxy. We observe four morphological types: compact 
and isolated galaxies (33 ± 8%); amorphous or disc morpholo- 
gies (35 ± 8%) and galaxies with obvious multiple components 
(27 ± 7%). Note that several of the galaxies have too poor S/N 
in the HST images to determine a classification (5 ± 4%). We note 
that the close-companion rate is only marginally above that for a 
similar if-band selected sample: 21% of 18 < < 19.5 galax- 
ies selected from the full COSMOS survey have an / < 25 mag 
neighbour within 2.5". 

The mean half-light radius (ri/2! an effective measure of 
galaxies' size) for the Ha emitters is ('"1/2) = 2.1 ± 0.5 kpc. 
How do these galaxies compare with other UV-selected systems 
at the same epoch? Erb et al. (2004) report morphologies and sizes 
for 13 'BX' galaxies at 2 ~ 2 in the GOODS-N field (although 
these galaxies were selected partially for their known elongation, 
see also Forster-Schreiber et al. 2006). The 'BX' galaxies appear 
to have qualitatively similar properties to many of the Ha selected 
galaxies in this work: clumpy, 'tadpole' features (see also Law 
et al. 2007). BX galaxies in the Erb et al. (2004) sample have 
ri/2 = 1.9±0. 5 kpc, implying they have similar sizes (and qualita- 
tively similar morphologies) to Ha-selected star-forming galaxies 
at the same redshift. 



4.2.2 Spitzer 24p,m and VIA 1.4 GHz observations - obscured 
star-formation 

The entire COSMOS field has been observed with the Multiband 
Imaging Photometer (MIPS) on the Spitzer Space Telescope. The 
24/im flux at 2; ~ 2.23 probes polycyclic aromatic hydrocarbons 
(PAH) emission lines. In dusty star-forming galaxies at this red- 
shift, 24^m should be dominated by the prominent l.lfim emis- 
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sion feature, and so should be a good indicator of obscured star- 
formation. To investigate the possibility that some of the Ha emit- 
ters are dusty luminous starbursts, we cross-correlate the Ha cata- 
logue with the Spitzer-COSMOS (Sanders et al. 2006) MIPS lAjjxa 
catalogues using a simple matching radius of 2". Seven Ha emit- 
ters from our sample of 55 are detected in both the wide and 
deep 24/um MIPS maps (flux limits of 0.3 mjy and 0.06 mjy re- 
spectively) with a noise weighted average 24/im flux density of 
(/24Mm) = 0.20 ± 0.03 mJy. 

In order to search for a statistical signature of mid-infrared 
emission from the whole sample of Ha emitters we stack the 24/im 
data in regions around the source positions (the Ha fluxes of 24/L(m 
detections are not significantly different from the overall Ha pop- 
ulation, so we include these in the stack). The resulting stacked 
image yields a significant flux excess, /24pm ~ 0.11 ± 0.01 mJy. 
The uncertainty is derived by stacking 50x55 thumbnail images 
around random locations in the 24/um map, and measuring the scat- 
ter in the fluxes measured in identical apertures in each stack. The 
significant 24^m flux suggest weak mid-infrared emission in these 
star-forming galaxies. Since COSMOS was also observed at 70/im 
and 160/xm with MIPS, we apply a similar stacking analysis for 
these wavelengths in an attempt to place constraints on the far- 
infrared properties of the Ha emitters. We do not find significant 
flux excess in either case, and so we define 3(7 upper limits of 
/tomih < 2.1 mJy, and /i60/im < 9.2 mJy. 

How does the infrared luminosity compare to the Ha lumi- 
nosity? To compare these two values we first estimate the total- 
infrared luminosity by normalising an M 82-like spectral energy 
distribution to the observed 24/im flux and integrating over 8- 
1000/xm (the choice of SED is motivated by the discussion in 
§4.2.3). The SFR is estimated using the calibration of Kennicutt 
(1998), SFR[IR] = 4.5 x IQ-^'' (Lg-ioooMm/ergs-i) M© yx-\ 
We find SFR[IR] = 90 ± 40MQyr"\ where the uncertainty 
is derived by recalculating the total infrared luminosity by intro- 
ducing a 50% variation in the normalisation of the SED, since 
at z = 2.23 the 24^m flux probes the 7.7/im PAH emission 
line, and the fraction that this band can contribute to the total in- 
frared flux can vary. How does this compare with the Ha measure- 
ments? Without applying any correction for extinction at Ha, we 
calculate the average SFR[Ha] = 40 ± 20 M© yr"^ (the uncer- 
tainty reflecting the Itj scatter in Ha luminosities, and assuming 
SFR[Ha] = 7.9 x IQ-"*^ (Lna/ergs^^) Mq yr-\ Kennicutt et 
al. 1998). Including the canonical extinction correction of 1 mag, 
this rises to SFR[Ha] = 100 ± 45M0yr"^ in good agreement 
with the SFR derived from the average infrared luminosity. As de- 
scribed above, the true level of extinction could be a function of 
luminosity, however this result suggests that our adoption of a con- 
stant reddening correction does not result in a significant underes- 
timate of the SFR. 

Four Ha emitters are formally detected in the 1 .4 GHz VLA 
map of COSMOS (Schinnerer et al. 2004), with an average flux of 
(/i.4GHz) = 160 ± 20/xJy. Again, these objects' Ha fluxes are 
not significantly different from the general population, and so we 
employ the same stacking technique to find the 'average' 1.4 GHz 
flux for the sample. We find /i.4ghz = 9.7 ± 1.2 /ijy (the er- 
ror is calculated in the same manner as the 24/im stack - i.e. re- 
peating the analysis for randomly co-added thumbnail images). Pa- 
rameterising the correlation between infrared and radio luminos- 
ity as (j24 = log(/24Mm/,fi.4 GHz) (sce Ibar et al. 2008) we find 
924 = 0.7± 0.2. This value is in good agreement with the expected 
924 for an M82 SED at z = 2.23, 924 = 0.6 ± 0.1 (Ibar et al. 
2008) , and so converting the radio luminosity to a far-infrared lu- 



minosity (and thus SFR) is consistent with our derived SFRs from 
the 24/im luminosity and extinction corrected Ha luminosity de- 
scribed above. 

4.2.3 Average spectral energy distribution of Ha emitters 

In Figure 8 we show the average spectral energy distribution of the 
Ha emitters. The optical/near-IR wfi VgnzAT+FSUW+IRAC 3.6- 
8/im photometry is averaged over all the Ha emitters (weighted 
by the individual photometric errors) and then fit with a choice of 
Sa-d and burst spectral templates (Bnizual & Chariot 1993) red- 
shifted to z = 2.23 using the hyperz photometric redshift code 
(Bolzonella et al. 2000). Note that the IRAC measurements are the 
3" aperture fluxes corrected to total flux using the values suggested 
by flie S-COSMOS team: 0.90, 0.90, 0.84 and 0.73 for the 3.6, 
4.5, 5.8 and 8/um cameras respectively. Coverage of the rest-frame 
near-infrared is useful for improving the photometric redshifts of 
high redshift objects, since it allows the location of the 1.6/im stel- 
lar bump to be tracked (e.g. Sawicki 2002). The wavelength of this 
feature corresponds to the minimum opacity of H~ ions in the at- 
mospheres of cool stars (John 1988), and thus is an indicator of the 
evolved (> 20Myr old) stellar population. The average 3.6-8/xm 
SED agrees with the 1.6/im feature redshifted to z = 2.23. The 
data are well fit by an Sc spectral type template with a foreground 
screen reddening of Ay = 1.0 mag (this parameter was free to 
vary in the range < Ay < 2). This suggests the UV contin- 
uum in these sources may be slightly more obscured than we are 
assuming for the Ha line emission. 

The good fit to the spectral template is compelling evidence 
that our narrow-band plus photometric selection is indeed identify- 
ing a population of z = 2.23 star-forming galaxies, and shows that 
improved photometric redshifts that make use of bands redward of 
K will help in improving the narrow-band selection (Mobasher et 
al. 2007). As IRAC is probing rest-frame near-infrared emission, 
we can estimate the stellar mass of these galaxies from the absolute 
rest-frame .ft'-band luminosity. We estimate the rest-frame abso- 
lute K-bwd magnitude from the 5.8/im flux, Mk = —25.4 mag 
(AB magnitude). The stellar mass, M*, is estimated using a similar 
method to Borys et al. (2005), assuming M* is equal to the inte- 
grated stellar mass at the age of the galaxy. Note that our template 
assumes a continuous star-formation history, with a Miller & Scalo 
(1979) initial mass function (a burst template does not provide a 
satisfactory fit to the observed photometry). If the /«'-band luminos- 
ity traces stellar mass, then = iQ-O '^iMK -3.3) /(^Lk/M) Mq 
where Lk /M is the mass-to-light ratio, and the factor 3.3 corre- 
sponds to the absolute solar 7f-band luminosity (Cox 2000). For 
continuous star-formation we estimate Lk /M from the description 
of Borys et al. (2005) who use the Starburst99 stellar synthesis 
code (Leitherer et al. 1999) to find Lk/M = 103(r/Myr)-" ''® 
for ages t > 100 Myr. The age returned from HYPERZ is 1 Gyr, 
similar to the mean age of z ~ 2 UV-selected star-forming galaxies 
in the survey of Erb et al. (2006). Adopting a realistic range in ages, 
0.5-2 Gyr we find M* ~ 6.1-11.5 x \Q^°Mq. This is larger flian 
the mean stellar mass of the sample in Erb et al. (2006), who find 
Af* = (3.6 ± 0.4) X 10^*^ Mq, but the disparity between the two 
mass estimates may be simply due to the different derivation meth- 
ods used: comparison of the spectral energy distributions to stellar 
synthesis models in Erb et al. (2006) versus simple age-dependent 
mass-to-light ratios here. 

In addition to the UV-near-IR photometry, we also show on 
Figure 8 the longer wavelength data described above: 24/im, lOpm, 
160/ttm and 1.4,GHzcm statistical detections and limits. For com- 
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Figure 8. The average SED for Ho emitters based on UV-radio photometry 
from the COSMOS archive averaged over all 55 candidates. In the inset we 
fit these with a Sc galaxy template at z = 2.23 reddened by Ay = 1 mag. 
The fit agrees well with the average SED, suggesting that our narrow- 
band selection is indeed identifying star forming galaxies at z = 2.23. 
The main panel extends the SED to longer wavelengths by stacking the 
24/im, 70/^m, 160^m and 1.4 GHz data (the 70/im and 160/jm points are 
3iT upper-limits), and we overlay the M 82 template, redshifted to 2 = 2.23 
and scaled in flux accordingly. Again, this seems to verify the redshift se- 
lection, and suggests that the galaxies we select are similar to local starburst 
galaxies in their broad SEDs. 



parison we show the M 82 galaxy SED (a local starburst) red- 
shifted to z = 2.23 and scaled in flux to match the 24/im flux 
density accordingly. The average 24/im/1.4GHz flux is consistent 
with what would be expected for a typical star-forming galaxy at 
z = 2.23 (Boyle et al. 2007; Ibar et al. 2008). This suggests that 
on average these galaxies are unremarkable starbursts, with SFRs 
< 100 M0 yr^^, with properties broadly similar to the most ac- 
tively star-forming galaxies found in the local Universe. We note 
however, that there is a departure from the similarity to M 82 in the 
rest-frame UV, with our composite SED showing quite significant 
UV-excess over what would be expected from M 82 at this redshift 
(Ibar et al. 2008). We interpret this as indicating that perhaps on av- 
erage, the observed Ha emitters are not as dusty as the local LIRG, 
or they simply have higher escape fractions of UV photons. 



4.3 Clustering of Ha emitters 

4.3.1 Evaluating the clustering strength 

Estimating the clustering properties of star-forming galaxies can be 
a powerful tool in understanding their formation and evolution, be- 
cause the clustering strength provides information about the dark 
matter halos these galaxies reside in. The descendants (and progen- 
itors) of these galaxies can then be traced by analysing the evolution 
of the halos themselves. Despite our relatively small sample size, 
we note that it is selected in a relatively narrow redshift range with 
little dilution of the projected clustering, so we attempt to estimate 
the clustering strength of Ha emitters using the angular correla- 
tion function, lu{9), from which we can derive the physical correla- 




Figure 9. The two-point angular correlation function of Ha emitters, 
uj{9). The data are fit with a power law Ad^'^'^ , and we find A = 
41.8 it 5.0 (for 9 in arcseconds), corresponding to a correlation length 
ro = 4.2^q'2 Mpc. The insets show the likely uncertainty on the fit. 
The histogram shows the distribution of amplitudes when uj(d) is recalcu- 
lated for a different realisation of the random catalogue (this was repeated 
1000 times). The contour plot represents when the fit is allowed to vary 
both the amplitude and slope /3 of the power-law fit, thus demonstrating the 
covariance dependence on the uncertainty in A. We find an adequate fit is 
obtained without varying /3, and therefore in our estimates we fix it at the 
fiducial value of /3 = —0.8, but it appears a better fit would be found for 
/3 ~ — 1, and a larger amplitude. The effect of this increase in amplitude 
and /3 would correspond to a slight decrease in tq: A = 95, /3 = — 1 cor- 
responds to ro ~ 3.6 h^^ Mpc; in better agreement with the measurement 
from the simulation (§4.3.2). Thus our current estimate of the clustering 
strength might be a slight overestimate. These are preliminary findings - a 
better estimate requires a larger sample. However, these initial results sug- 
gest moderate clustering on scales of 10-100" (i.e. sub-Mpc or sub-halo). 



tion length, ro. We use the estimator proposed by Landy & Szalay 
(1993): 



m = 1 + 



NrV DD{e) Nr DR{9) 

Nd J RR{e) Nd rr{9) ■ 



(4) 



Here DD{6) is the number of pairs of real 'data' galaxies within 
{6, 9 + 59), DR{9) is the number of data-random pairs and RR{9) 
is the number of random-random pairs. Nr and No are the num- 
ber of random and data galaxies in the survey. The random cat- 
alogue is generated by uniformly distributing 20Nd false galax- 
ies over a geometry corresponding to the survey field-of-view, 
which is appropriate for the small sample. The errors for each an- 
gular interval (spaced at 0.25 dex) are estimated with 5u{9) = 
(1 -\- u]{9))/ \J DD{9), which estimates the Poisson noise in each 
bin (Landy & Szalay 1993). We present our results in Figure 9, 
and fit the data with a power-law of the form A9~^'^ , where A is 
the amplitude of the angular correlation function, and the power- 
law index —0.8 is the fiducial value. For 9 in arcseconds, we find 
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A = 41.8 ± 5.0. The uncertainty is the Icr range of A when the 
fit is repeated for 1000 realisations of ol}{9), with different random 
catalogues. The integral constraint, C, which accounts for bias in- 
troduced by only observing a small region of the sky, will result in 
a reduction of the amplitude of the two point correlation function: 
yl(6'"'' ** - C) . We estimate the effect following Roche et al. (2002), 
and find it to be small for our survey area, with C — 0.0055 for 6 
in arcseconds - we include it when fitting the amplitude A. 

A more useful description of the physical clustering of galax- 
ies is the real-space correlation length, ro, and this can be calcu- 
lated directly from the angular correlation function using the in- 
verse Limber transformation (Peebles 1980; Efstathiou et al. 1991) 
which relates the spatial correlation function to the angular corre- 
lation function (provided the redshift distribution is known). 

Using clustering simulations described below, we have mod- 
eled the effect of the Gaussian filter selection function on our clus- 
tering measurements. We compare the projected clustering ampli- 
tude, A, derived for the population in a uniformly selected volume 
(i.e. assuming a top-hat transmission function) to that derived for 
samples selected with a luminosity-dependent selection function 
appropriate for our filter transmission curve. We find no statistical 
difference when including a luminosity-dependent volume correc- 
tion, with (ylcaussian) — (^uniform) ~ 45, uslng 100 slmulatioHS 
for each method. Therefore, for simplicity in the following depro- 
jection analysis we treat the projected correlation amplitude as if 
it came from a uniformly selected sample over the narrow redshift 
slice al z — 2.22-2.25. We also assume that the real space cor- 
relation function is independent of redshift over this narrow range 
(this is the same approach used in other high redshift narrow-band 
surveys, e.g. Kovac et al. 2007). 

We find ro = 4.21q'2 Mpc, where the uncertainty is de- 
rived by reevaluating ro after applying the l-a uncertainty on the 
fit to oj{6) described above. This likely underestimates the true er- 
ror due to added effects such as cosmic variance, which the reader 
should note could be an important effect in our 220 X 10^ Mpc^ 
survey and to assess this we now turn to our simulations. 



4.3.2 Comparison to theoretical predictions of the chistering 
properties of star-forming galaxies 

In addition to the luminosity distribution predicted by semi-analytic 
recipes (§4.2.2), we can also compare observed clustering proper- 
ties of Ha emitters to theoretical predictions from these same mod- 
els. The B06 model populates dark matter halos within a ACDM 
framework cosmology, where the 'observable' properties such as 
optical luminosity are dictated by a physically motivated prescrip- 
tion for the behaviour of baryons cooling within the halos. Thus, 
we can directly evaluate ro for Ha emitters within the simulation 
volume. 

Taking the B06 model, we limit simulated luminosities to the 
range 42 ^ \ogLy_a (ergs~^) ^ 44 (approximately the range 
of observed luminosities) and Wna > 50A, and then project the 
positions of the galaxies onto a 2D 'sky'. We then measure the 
amplitude of the two-point angular correlation function and con- 
vert to ro in exactly the same way as before (§4.3.1). We find 
ro = 5.7^0 7 Mpc, slightly larger than our observed value (but 
within ~2cr). Part of this disparity could be due to the slight ex- 
cess of bright Hq emitters in the simulations: the clustering of Ha 
emitters in the z — 0.24 study of Shioya et al. (2008) was seen to 
increase with Lua ■ The reader should also take into account the rel- 
atively small size of our survey (despite it being the largest to date) 
compared to the effective area of the Milli-Millennium. Thus, there 
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Figure 10. Evolution of the correlation length ro in co-moving Mpc for 
a variety of galaxy populations over z = 0-4. Literature points include 
submm galaxies (Blain et al. 2004); QSOs (Croom et al. 2005); LBGs 
(Ouich et al. 2004; 2005); BM/BX galaxies (Adelberger et al. 2005), and 
local Ha selected galaxies (Shioya et al. 2008). We relate the clustering 
properties to the expected clustering from models of the clustering of dark 
matter halos, assuming a model for the evolution of bias (Moscardini et al. 
1998). Our results suggest that Ha emitters at z = 2.23 are hosted by dark 
matter halos of mass ^lO^^ Mq - in good agreement with the masses of 
halos expected to host similarly selected galaxies at the same redshift (i.e. 
BM/BX galaxies). Thus, these Ha emitters appear to be the progenitors of 
Milky Way-like galaxies at z = 0. 



are additional uncertainties on our estimate of ro due to the likely 
field-to-field variations which are still significant on this scale. 

One way of illustrating this is to randomly select sub-volumes 
in the Milli-Millennium box corresponding to our survey volume. 
Projecting galaxies selected in these sub-volumes allows us to eval- 
uate the amplitude of their angular correlation function in an identi- 
cal way to that performed in §4.3.1. The scatter in amplitude (fixing 
the power-law slope to the fiducial value, —0.8, as before) for a set 
of random positions is a good indicator of the expected Cosmic 
variance on these scales. Statistically we find a similar amplitude 
to the real data, (A) ~ 45 ± 31 (for 9 in arcseconds), but with 
large scatter from the field-to-field variations. Thus, the dominant 
source of uncertainty in our measurement of the clustering strength 
of these objects is Cosmic variance, exacerbated by the relatively 
small numbers of objects in our sample. 



4.3.3 Halo mass 

The bias parameter describes how the observed galaxy distribution 
traces the underlying matter distribution. Assuming a model for the 
evolution of bias over cosmic time, it is possible to estimate the host 
halo mass of different galaxy populations by comparing their cor- 
relation length, ro to that of galaxies within halos of various mass. 



More importantly, this analysis can provide clues to determine the 
progenitor populations of galaxy populations in the local Universe. 
In Figure 10, we compare our results to predictions of ro{z) for 
dark matter halos with a fixed minimum mass dark matter halos of 
mass Mmin > lO""^''* Ir^ Mq (Matarrese et al. 1997; Moscar- 
dini et al. 1998). This has become a popular way to present the re- 
sults of clustering analyses in a cosmological context (e.g. Overzier 
et al. 2003; Blain et al. 2004; Farrah et al. 2006). We assume a 
ACDM cosmology, and an evolving bias model b{z) (Moscardini 
et al. 1998). We use the values tabulated by Moscardini et al. (1998) 
for various fixed minimum mass halos. We evaluate the predicted 
ro of galaxies by solving ^gai(«) = £'(a)^6(«)^^DM(0), where 
D{z) is the growth factor and ^ is the real space correlation func- 
tion. 

Our results suggest that Ha emitters at 2: = 2.23 reside in 
moderate mass halos, with M^in of order 10^^ Mq, similar to 
that of a Milky Way mass halo at z = 0, suggesting that these 
high redshift star-forming galaxies are the progenitors of galax- 
ies seen today. Given that the semi-analytic model is tied directly 
to the underlying dark matter halos it is possible to extract the 
halo masses that the B06 Ha emitters actually reside in. Limit- 
ing the model luminosities to the observed range, we find halo 
masses spanning logMhaio (Mq) = 11.2 — 12.9. with a median 
of log Afhaio (Mq) = 11.7. This is in reasonable agreement with 
the halo mass estimate from the observed deprojected angular cor- 
relation function. 

In comparison to other active populations at the same epoch 
(QSOs, SMGs and UV-selected BX/BM galaxies), wc see that Ha 
emitters typically reside in lower-mass halos than SMGs and QSOs 
and are not directly related to them, but the Ha emitters have a sim- 
ilar clustering strength to BX/BMs at the same redshift (Fig 10). 
This is perhaps not surprising, given the overlap in selection be- 
tween the two populations (Fig. 3), and suggests moderate cluster- 
ing of star-forming galaxies at this redshift. 



5 SUMMARY 

We have presented results from the largest near-infrared narrow- 
band survey for Ha emission at z = 2.23 yet undertaken, 
probing 220 x 10^ co-moving Mpc^ to a Ha line flux of 1 x 
10~^® ergs~^ cm~^. Selecting potential z = 2.23 Ha emitters 
on the basis of the significance of their narrow-band excess at 
2.121/im and broad-band colours and magnitudes, we detect 55 
galaxies over 0.603 square degrees in the COSMOS field, a vol- 
ume density of (2.5 ± 0.3) x 10~'*Mpc~^. Our findings can be 
summarised as follows: 

• The luminosity function is well fit by a Schechter func- 
tion with 0* = (1.45 ± 0.47) x 10"^Mpc~^, logL* = 
42.83 ± 0.13 erg s^^ assuming a faint end slope of a = —1.35, 
as found by Gallego et al. (1995) for the Ha LF at z = 0. We 
find strong evolution in the Ha luminosity function from z = 0- 
2.23 equivalent to a 10 x increase in characteristic luminosity. The 
z — 2.23 LF is very similar to the z — 1.3 distribution of Yan et 
al. (1999), implying little evolution in the population between these 
epochs. We note however that deeper observations will be required 
to probe the faint end of the LF, which could impact on our estimate 
of the integrated psfr. 

• The extinction corrected SFRD determined from the inte- 
grated LF is O.nto ll Mq yr-i Mpc"^ for Lhc > 10^^ erg s~^ 
2 = 2.23 (including a 15% correction for AGN contamination to 
the luminosity density). This result for the first time directly traces 
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the evolution of psFR out to z = 2.23 using Ha alone, and sup- 
ports the view that the SFRD undergoes a plateau at z ~l-2 prior 
to sharp decUne to the present day. 

• The Ha emitters have infrared and radio properties similar to 
that expected for local star-forming galaxies: our statistical mea- 
surement as of the 24/xm and 21 cm flux densities are consistent 
with a LIRG SED at z = 2.23, with the infrared derived SFR 
agreeing with the Ha derived rate if we assume the canonical Ha 
extinction, Aua = 1 mag. However, in comparison to M 82 (an 
archetypal LIRG), we observe a UV light excess over the tem- 
plate, which suggests that these galaxies are not as obscured as 
M 82, or at least the escape efficiency of UV photons is higher 
in these galaxies. HST-ACS imaging reveals that the Ha emitters 
have a range of morphologies including compact and disturbed sys- 
tems: several comprise of two components separated on scales of 
< 10" kpc, indicating recent or ongoing merger activity, as might be 
expected for starburst galaxies. Quantitatively the rest-frame UV- 
morphologies of the Ha emitters are similar to UV-selected star- 
forming ('BX') galaxies at the same redshift, with mean half-light 
radii of ri/2 = 2.1 ± 0.5 kpc. 

• We have performed the first clustering analysis of Ha emit- 
ters at z = 2.23. First, we measure the amplitude of the two- 
point angular correlation function uj(6), and find A = 41.8 ± 5.0 
for 6 in arcseconds. The inverse Limber transform provides us 
with the real-space correlation length ro, which we find is ro = 
4-2lo 2 Mpc. Comparing this result to other galaxy popula- 
tions over cosmic time, these high redshift star forming galaxies 
have similar clustering properties to similarly selected galaxies at 
high and low redshift, and reside in dark matter halos of masses of 
~10^^ Mq. This is agrees with the range of masses of host halos 
of Ha emitters selected from the Millennium simulation. Our Ha 
emitters appear to be slightly less strongly clustered than QSOs 
and SMGs, but similar to BX/BM galaxies at this redshift. Com- 
parisons to models of clustering evolution taking into account bias 
evolution, our results suggest that these galaxies will likely evolve 
into L* galaxies, similar to the Milky Way at 2 = 0. 

These observations represent the first results of an innova- 
tive survey for emission line objects over several square degrees 
at z = 0.84, 1.47 and 2.23 corresponding to narrow-bands in ,7, H 
and K: the Hi-Z Emission Line Survey (HiZELS) on UKIRT. The 
outcome of this survey will be large samples (~1000 at each red- 
shift) of identically selected 'typical' star-forming galaxies within 
well defined volumes. The large area will enable us to place even 
stronger constraints on the star formation history over this cru- 
cial epoch from identically selected star-forming galaxies in each 
redshift slice. Our custom-made J and H narrow-band filters at 
1.204/im and 1.618/im respectively will not only detect Ha emis- 
sion at z = 0.84 and z = 1.47, but also [O II]A3727 and 
[O III]A5007 emission lines from sources at z = 2.23, exactly the 
same redshift as the current 2.121/um survey. This will be a useful 
test for confirming the redshifts of a subset z = 2.23 Ha emitters 
by matching to detections in these other narrow-band filters. It will 
also provide a more complete survey of the emission line popula- 
tion of star-forming galaxies and AGN at this epoch. 
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